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Abstract. We present techniques used to estimate the backgrounds in the search for the Standard Model Higgs 
boson in the H — > WW^ — > tvtv decay channel with the ATLAS experiment at the LHC. The dataset corre- 
sponds to 13 fb _1 of integrated luminosity taken at a center of mass energy of 8 TeV. Only the final states with 
an electron, muon, and zero or one jet are presented here. 



1 Introduction 

In July 2012 the ATLAS 1 1 ] and CMS [ 3 ] experiments at 
the LHC announced the discovery of a new particle consis- 
tent with the long-sought Higgs boson (HO. The results 
presented here constitute an update of the H — > WW^ — > 
tvtv analysis with a dataset of 13 fb" 1 taken at a center of 
mass energy of 8 TeV Q. In particular, we summarize the 
methods of background estimation for this search channel, 
which focuses on the low mass Higgs signal region. 

The WW^ — > tvtv decay channel of the Higgs bo- 
son has a final state defined by two leptons and miss- 
ing transverse energy (£"™ 1SS ) from neutrinos which es- 
cape detection. The analysis presented here considers only 
the final states with one electron, one muon, and zero 
or one jets with transverse momentum (p T ) greater than 
25 GeV. The leptons are required to be isolated and the 
leading (subleading) lepton must have p? > 25 (15) GeV. 
Additionally, the event must have relative missing trans- 
verse energy (E^ { ) greater than 25 GeV, where E™*^ = 
sin(min(A0, -)) and A0 is the azimuthal angle be- 
and the nearest reconstructed lepton or jet. 
This definition helps to reject events where a mismeasure- 
ment of one of the reconstructed objects is a major source 
of the £™ 1SS . After these pre-selection cuts, the signal re- 
gion is divided into zero and one jet bins and additional 
topological cuts (specific to each bin) are applied to dis- 
criminate the Higgs signal from background contributions. 

Many processes in the Standard Model (SM) produce 
final states similar to that in H -> WW^ -> tvtv. The 
largest background contribution is the irreducible SM WW 
background. The next largest background contributions 
come from tt and single top production. These back- 
grounds are primarily relevant for larger jet multiplicity 
bins but are also present in the zero jet bin. Another im- 
portant background for the analysis is the W+jets back- 
ground. Here a single W boson is produced in association 
with one or more jets, and one of the jets fakes a final state 
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Figure 1. The jet multiplicity distribution after signal pre- 
selection cuts are applied |2|. 



lepton. Other backgrounds which will not be discussed in 
great detail include the Z+jets and diboson (WZ, ZZ, Wy) 
backgrounds. Figure [T] shows the jet multiplicity distribu- 
tion before the selection separates the events into jet multi- 
plicity bins. After the pre-selection, the WW background 
is the dominant background in the zero jet bin while the 
top background dominates the higher jet multiplicity bins. 



2 W+jets and Other Minor Backgrounds 

The W+jets background arises from SM W boson produc- 
tion in association with jets where one jet produces an 
object reconstructed as a lepton. This can be a real lep- 
ton produced by heavy quark decay or a product of the jet 
fragmentation that is incorrectly reconstructed as an elec- 
tron. The W+jets background contribution is estimated 
by a data-driven method called the "fake factor" method. 
First, a control region is defined in data by requiring one 
lepton with the same identification and isolation criteria as 
the signal leptons. The second lepton is required to be anti- 
identified, satisfying loosened isolation criteria and failing 
at least one identification requirement. These events are 
then required to pass the full signal selection. 
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A fake factor, the ratio of the number of lepton can- 
didates passing all identification requirements and signal 
selections to the number that are anti-identified, is derived 
in an inclusive data dijet sample. This factor is used to 
scale the number of events in the control region to the sig- 
nal region. The total relative uncertainty on the estimate is 
50%, dominated by the systematic uncertainty on the fake 
factor. 

Figure[2]shows the transverse mass (m?) in a same sign 
validation region (where two same sign rather than two 
opposite sign leptons are required). This region is com- 
posed largely of W+jets and WZ/ZZ/Wy backgrounds and 
is used to validate the modeling of kinematic variables for 
these samples. This region shows that the m? is well mod- 
eled (within statistics) for these samples. 
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Figure 2. m T in the zero jet same sign validation region | 2|. 

Here we briefly mention other minor backgrounds 
which will not be discussed in further detail. First, the 
Z+jets background comes from a case where the Z decays 
to two leptons and there is fake missing transverse energy 
in the event due to the calorimeter resolution. This back- 
ground is normalized to data in a control region requir- 
ing ma < 80 GeV and A<pa > 2.9. Finally, the normal- 
izations for the remaining backgrounds (WZ/ZZ/Wy) are 
taken from Monte Carlo simulation. 
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Figure 3. m T distribution in the top one jet control region. The 
rate predicted by Monte Carlo simulation has not yet been nor- 
malized to the data. Q 

3 tt and Single Top Backgrounds 

The tt and single top backgrounds are normalized together 
in control regions (CR) separated by jet multiplicity. The 
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Table 1. Total uncertainties on backgrounds normalized using 
simple NF scaling in CRs |2|. 



one jet bin is normalized in a CR defined with the same 
pre-selection as the signal region (SR) and at least one b- 
tagged jet. For the zero jet bin, there are two CRs used 
for the background estimate. First, a CR with only the SR 
pre-selection is used to estimate the fraction of top events 
passing a jet veto. A second, b-tagged CR is then used 
to estimate the probability of having no other jets recon- 
structed in the event and is used as a correction to the frac- 
tion estimate from the first CR. 

Figure [3] shows the m T in the one jet CR before any 
normalization factors are applied. The normalization fac- 
tors (NF), or ratio between the data and Monte Carlo pre- 
dictions, derived via these methods are 1.04 ± 0.05 (stat.) 
for the zero jet channel and 1.03 ± 0.02 (stat.) for the one 
jet channel. 

4 Standard Model WW Background 

The SM WW background is estimated in a CR which uses 
the SR pre-selection cuts (two leptons, missing transverse 
energy) and is separated into jet bins. While the SR re- 
quires ma < 50 GeV, the WW CR requires m u > 80 GeV. 
This is the largest background in the signal region. The 
WW modeling in simulation is done with a tune of Powheg 
for event generation and Pythia 8 for parton showering. 

Figure]?] shows the mr in the WW zero and one jet CR, 
before the application of any WW NF. In both the zero 
and one jet (but particularly in the one jet) WW CRs, there 
is a non-negligible contrbution from the top backgrounds. 
Therefore, the top backgrounds are first normalized using 
the procedures described in Section[3]before all of the non- 
WW backgrounds are subtracted from the event yields in 
the CR to derive the final normalization. The ratio of the 
data (with non WW background subtracted) to the WW 
simulation prediction is 1.13 ± 0.04 (stat.) in the zero jet 
CR and 0.84 ± 0.08 (stat.) in the one jet CR. The NF differ 
between the zero and one jet channels because these CR 
are correcting for the over-prediction of the jet multiplicity 
distribution by the current Powheg+Pythia 8 tune used by 
ATLAS. 

5 Background Predictions and 
Uncertainties 

Table [T] shows the total uncertainties on the background 
normalization for the backgrounds which use the sim- 
ple data to simulation scaling in the CR for normaliza- 
tion. Theoretical uncertainties on the estimates include 
differences due to the choice of generator and parton 
shower/underlying event as well as other contributions. 
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Figure 4. The mj distribution in the WW zero (one) jet CR is 
shown in the top (bottom) plot. The WW rate predicted by Monte 
Carlo simulation has not yet been normalized to the data. The top 
backgrounds have been normalized according to the procedure 
described in the text Q. 
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Table 2. Normalization factors (NF) for all backgrounds whose 
normalizations are taken from data Q. 



in the zero jet bin, while both WW and top are dominant 
in the one jet bin. In zero jet, there is a total of 774 ± 9 
(stat.) expected background events, and 555 ± 5 (stat.) of 
those are SM WW events. In the one jet, out of 386 ± 5 
(stat.) total expected background events, 1 18 ±2 (stat.) are 
SM WW events while 134 ± 5 (stat.) are tt events. The dif- 
ference between the background expectation and the 917 
(433) events observed in data in the zero (one) jet bin are 
due to the presence of the Higgs-like signal. 
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Figure 5. m T distribution in the zero (one) jet bin after all signal 
selection cuts in the top (bottom) plot |2|. The expectation for a 
125 GeV Higgs signal is shown in red. 



The experimental uncertainties are dominated the jet en- 
ergy scale and resolution and, in the one jet bin, the b 
tagging efficiency. The "Crosstalk" column refers to un- 
certainties on other backgrounds which must be subtracted 
from the CR before the normalization of the desired back- 
ground can be computed. Notice in particular that the 
WW one jet normalization has a large contribution from 
crosstalk due to the fact that a top background contribution 
must be subtracted from the CR before the normalization 
is computed. 

Table |2] shows the NF derived for all of the back- 
grounds whose normalizations are taken from data. In the 
case of everything except the top zero jet background, this 
factor is simply the ratio of the number of Monte Carlo 
events to data events in the appropriate CR for that back- 
ground. We can see that most of the backgrounds do not 
require very large corrections to their normalization (none 
more than 16%). 

Figure [5] shows the m? distribution after the signal se- 
lection cuts have been applied for the zero and one jet bin. 
It can be seen here that the WW background is dominant 



6 Conclusion 

A wide array of estimation methods can be employed 
to understand the complicated background processes that 
factor into a search for a H —> WW^ — > tviv signal. Sim- 
ple data to simulation scaling in control regions is used 
for backgrounds such as SM WW (or top in the one jet 
bin) where the variable shapes are well modeled but their 
normalizations may be incorrect. More complicated data- 
driven methods, such as the W+jets fake factor method, 
can also be used when the backgrounds are not well mod- 
eled by simulation alone. 
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